A Shrinking-Based Dimension Reduction Approach for Multi-Dimensional Data Analysis
نویسندگان
چکیده
In this paper, we present continuous research on data analysis based on our previous work on the shrinking approach. Shrinking[2] is a novel data preprocessing technique which optimizes the inner structure of data inspired by the Newton’s Universal Law of Gravitation[1] in the real world. It can be applied in many data mining fields. Following our previous work on the shrinking method for multidimensional data analysis in full data space, we propose a shrinking-based dimension reduction approach which tends to solve the dimension reduction problem from a new perspective. In this approach data are moved along the direction of the density gradient, thus making the inner structure of data more prominent. It is conducted on a sequence of grids with different cell sizes. Dimension reduction process is performed based on the difference of the data distribution projected on each dimension before and after the datashrinking process. Those dimensions with dramatic variation of data distribution through the data-shrinking process are selected as good dimension candidates for further data analysis. This approach can assist to improve the performance of existing data analysis approaches. We demonstrate how this shrinking-based dimension reduction approach affects the clustering results of well known algorithms.
منابع مشابه
A Dimension Reduction Approach Using Shrinking for Multi-Dimensional Data Analysis
In this paper, we present ongoing research on data analysis based on our previous work on the shrinking approach. Shrinking [22] is a novel data preprocessing technique which optimizes the inner structure of data. It can be applied in many data mining fields. Following our previous work on the shrinking method for multi-dimensional data analysis in full data space, we propose a shrinking-based ...
متن کاملA Shrinking-Based Approach for Multi-Dimensional Data Analysis
Existing data analysis techniques have difficulty in handling multi-dimensional data. In this paper, we first present a novel data preprocessing technique called shrinking which optimizes the inner structure of data inspired by the Newton’s Universal Law of Gravitation[22] in the real world. This data reorganization concept can be applied in many fields such as pattern recognition, data cluster...
متن کاملPrincipal Component Multi Linear Analysis for Content Based Image Retrieval
In the process of content based Image retrieval (CBIR), image information is presented in descriptive features to obtain retrieval of image information. In the representation of descriptive features a large feature count is observed, which results in the overhead in processing. To reduce these descriptive features different dimensional reduction logic were used in which PCA is the most commonly...
متن کاملDifferenced-Based Double Shrinking in Partial Linear Models
Partial linear model is very flexible when the relation between the covariates and responses, either parametric and nonparametric. However, estimation of the regression coefficients is challenging since one must also estimate the nonparametric component simultaneously. As a remedy, the differencing approach, to eliminate the nonparametric component and estimate the regression coefficients, can ...
متن کاملA Chance Constraint Approach to Multi Response Optimization Based on a Network Data Envelopment Analysis
In this paper, a novel approach for multi response optimization is presented. In the proposed approach, response variables in treatments combination occur with a certain probability. Moreover, we assume that each treatment has a network style. Because of the probabilistic nature of treatment combination, the proposed approach can compute the efficiency of each treatment under the desirable reli...
متن کامل